Dataset Examples with Highlighted Detection Mask
Dataset repe_honesty__plain_val, n=122
Dataset ai_liar__original_without_answers, n=270
Dataset roleplaying__plain, n=371
Dataset insider_trading__upscale, n=241
Dataset insider_trading_doubledown__upscale, n=105
Dataset sandbagging_v2__wmdp_mmlu, n=1000
Dataset alpaca__plain, n=9994